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Abstract. The problem of intermediates in the fossil record has been fre- 
quently discussed ever since Darwin. The extent of 'gaps' (missing transitional 
stages) has been used to argue against gradual evolution from a common an- 
cestor. Traditionally, gaps have often been explained by the improbability of 
fossilization and the discontinuous selection of found fossils. Here we take an 
analytical approach and demonstrate why, under certain sampling conditions, 
we may not expect intermediates to be found. Using a simple null model, we 
show mathematically that the question of whether a taxon sampled from some 
time in the past is likely to be morphologically intermediate to other samples 
(dated earlier and later) depends on the shape and dimensions of the under- 
lying phylogenetic tree that connects the taxa, and the times from which the 
fossils are sampled. 
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1. Introduction 

Since Darwin's book On the Origin of Species by Means of Natural Selection, 
or the Preservation of Favoured Races in the Struggle for Life 2 s , there has been 
much debate about the evidence for continuous evolution from a universal common 
ancestor. Initially, Darwin only assumed the relatedness of the majority of species, 
not of all of them; later, however, he came to the view that because of the similarities 
of all existing species, there could only be one 'root' and one 'tree of life' (cf. 
[llj). All species are descended from this common ancestor and indications for 
their gradual evolution have been sought in the fossil record ever since. Usually, 
the improbability of fossilization or of finding existing fossils was put forward as 
the standard answer to the question of why there are so many 'gaps' in the fossil 
record. Such gaps have become popularly referred to as 'missing links', i.e. missing 
intermediates between taxa existing either today or as fossils. 

Of course, the existence of gaps is in some sense inevitable: every new link gives 
rise to two new gaps, since evolution is generally a continuous process whereas 
fossil discovery will always remain discontinuous. Moreover, a patchy fossil record 
is not necessarily evidence against evolution from a common ancestor through a 
continuous series of intermediates - indeed, in a recent approach, Elliott Sober (cf. 

applied simple probabilistic arguments to conclude that the existence of some 
intermediates provides a stronger support for evolution than the non-existence of 
any (or some) intermediates could ever provide for a hypothesis of separate ancestry. 
Moreover, some lineages appear to be densely sampled, whereas of others only few 
fossiliferous horizons are known (cf. [10]). This problem has been well investigated 
and statistical models have been developed to master it (see e.g. [B], [7]), [T!2]). 

In this paper, we suggest a further argument that may help explain missing links 
in the fossil record. Suppose that three fossils can be dated back to three different 
times. Can we really expect that a fossil from the intermediate time will appear 
(morphologically) to be an 'intermediate' of the other two fossils? We will explore 
this question via a simple stochastic model. 

In order to develop this model, we first state some assumptions we will make 
throughout this paper: firstly, we will consider that we are sampling fossil taxa of 
closely related organisms and which differ in a number of morphological character- 
istics. We assume this group of taxa has evolved in a 'tree-like' fashion from some 
common ancestor; that is, there is an underlying phylogenetic tree, and the taxa 
are sampled from points on the branches of this tree. 

It is also necessary to say how morphological divergence might be related to 
time, as this is important for deciding whether a taxon is an intermediate or not. 
In this paper, we make the simplifying assumption that, within the limited group of 
taxa under consideration (and over the limited time period being considered), the 
expected degree of morphological divergence between two taxa is proportional to the 
total amount of evolutionary history separating those two taxa. This evolutionary 
history is simply the time obtained by adding together the two time periods from 
the most recent common ancestor of the two taxa until the times from which each 
was sampled (in the case where one taxon is ancestral to the other, this is simply the 
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time between the two samples) . This assumption on morphological diversity would 
be valid (in expectation) if we view morphological distance as being proportional 
to the number of discrete characters that two species differ on, provided that two 
conditions hold: (i) each character has a constant rate of character state change 
(substitution) over the time frame T that the fossils are sampled from, and (ii) T 
is short enough that the probability of a reverse or convergent change at any given 
character is low. We require these conditions to hold in the proofs of the following 
results. We will discuss other possible relations of morphological diversification and 
distance towards the end of this paper. 



■n 



■T 2 



Figure 1: When the tree consists of only 
one lineage from which samples are taken at 
times Ti , T2 and T3, then clearly the distance 
di^z is always larger than di,2 and ^2,3- Con- 
sequently, -£1,3 > maxji?!^,^^}- 




Figure 2: For samples taken from different lin- 
eages of a tree, the distance of one particular 
sample from time T\ to the one of T3 can be smaller 
than the distance of either of them to the sample 
taken at time T2. Yet in expectation we always have 
Si, 3 > max{Ei,2, ^2,3} for two-branch trees. For 
more complex trees this can fail as we show in Ex- 
ample 12.71 



The simplest scenario is the case where the three samples all lie on the same 
lineage, so that the evolutionary tree can be regarded as a path (cf. Figure [T]). In 
this case, the path distance (and hence expected morphological distance) between 
the outer two fossils is always larger than the distance that either of them has 
from the fossil sampled from an intermediate time. But for samples that straddle 
bifurcations in a tree, it is quite easy to imagine how this intermediacy could fail; 
for example, if the two outer taxa lie on one branch of the tree and the fossil 
from the intermediate time lies on another branch far away (cf. Figure [2]) • But 
this example might be unlikely to occur, and indeed we will see that if sampling 
is uniform across the tree at any given time, in expectation the morphological 
distances remain intermediate even for this case (cf. Figure Yet for more 
complex trees, this expected outcome can fail, and perhaps most surprisingly, the 
distance between the earliest and latest sample can, in expectation, be the smallest 
of the three distances in certain extreme cases. 

Thus, in order to make general statements, we will consider the expected degree 
of relatedness of fossils sampled randomly from given times. Our results will depend 
solely on the tree shape (including branch lengths) of the underlying tree and the 
chosen times. 
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2. Results 

We begin with some notation. Throughout this paper, we assume a rooted binary 
phylogenetic tree to be given with an associated time scale < T\ < T 2 < T3. The 
number of Ti-lineages (of lineages extant at time Tj) is denoted by n^. For instance, 
in Figure [31 the number n\ of Ti-lineages is 3, whereas the numbers n 2 and 713 of 
T2- and T3-lineages are both 5. If not stated otherwise, extinction may occur in 
the tree. Every bifurcation in the tree is denoted by bi, where bo is the root. Note 
that in a tree without extinction, the total number of bifurcations up to time T3 
(including the root) is n.3 — 1. For every bi let U denote the time of the occurrence 
of bifurcation bi. We may assume that the root is at time to = 0. 

Now, for every bi, we make the following definitions: 

^' k :="k + fora11 j,k£{l,2,3},j^k 

where rij i denotes the number of descendants the subtree with root bi has at time 
Tj to the left of its root bi, and rij^ is defined analogously for the descendants on 
the right hand side of 6, . 

It can be seen that bifurcations for which at least one branch of offspring dies 
out in the same interval where the bifurcation lies always have P/' fc -value 0. Con- 
sequently, if either to < U < T\ or T\ < ti < T 2 or T% < ti < T 3 and one of ftj's 
branches becomes extinct in the same interval, respectively, then P?' is for all 
j, k. Note that the number P?' denotes the number of different paths in the tree 
from time Tj to time in the subtree with root bi and in which no edge is taken 
twice. 

Example 2.1. Consider the tree given in Figure fJl Here, the values Pf' for 
bifurcation b\ corresponding to time t\ are P\' 2 — n\ j-n^ 1 = 1-2 + 1-1 = 

3, Pi' 3 = 1 • 3 + 1 • 1 = 4 and P*' 3 = 1 • 3 + 2 • 1 = 5. 



In the sampling, select uniformly at random one of the Ti-lineages as well as one 
of the Tj-lineages to get the expected length E^j of the path connecting a lineage 
at time Ti with one at time Tj in the underlying phylogenetic tree. Then, the ex- 
pectation that a fossil from the intermediate time Ti also will be an intermediate 
taxon of two taxa taken from T\ and T3, respectively, refers to the assumption 
that E\£ > max{Ei i 2, ^2,3}- We will show in the following lemma that this last 
inequality can fail and describe the precise condition for this to occur. Moreover, 
we later show that E\^ can be strictly smaller (!) than both E1.2 and -£2,3 - that 
is the temporally most distant samples can, on average, be more similar than the 
temporally intermediate sample is to either of the two. 

Note that if P-' is 0, the corresponding branch does not contribute to the expected 
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Figure 3: A rooted binary phylogcnctic tree with three times Ti, T2, T3 at which taxa have been sampled. 
The dotted branches refer to taxa that do not contribute to the expected distances from one of these 
times to another and thus are not taken into account. On the other hand, bifurcation 62 at time £2 
shows that extinction may have an impact on the expected values. Such branches have to be considered. 



distance from one time to another. We can therefore assume without loss of gen- 
erality that all bifurcations bi have at least one descendant on their left-hand side 
and at least one on their right-hand side, each in at least one of the times T\,T2 1 T 3 . 
In Figure [3l branches that therefore need not be considered are represented with 
dotted lines. 



In order to simplify the statement of our results, for all bifurcations bi set 



Q 



3,k 



2 • P. 



TljUk 



for all j,fce{l,2,3},jVfc- 



Lemma 2.2. Given a rooted binary phylogenetic tree with times < T\ < T2 < T3 
and the root at time to = 0. Then, Ei 3 < E\% if and only if 



T 3 -T 2 < £ (Q-' - Q\' )U 

i:0<t 4 <Ti 



Proof. 

(1) 



nin 3 



n 3 (T 3 - Ti) +J2 [P?' 3 (T 3 -T 1 + 2{T 1 - U)) 



every 73-lineage 
has an ancestor 
in T\ 



z:0<ti<Ti 



Po' 3 (T 3 + Tr) 



ways along the root 
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In the above bracket, the three summands refer to different paths from time T\ 
to time T 3 . The first summand belongs to those paths that go directly from T\ 
to T 3 and thus have length T 3 —T\. There are 713 such ways as every T3-lineage 
has an ancestor in T\. The second summand sums up all paths going along one of 
the bifurcations 6; for i ^ 0. For every i, there are by definition exactly P i ' such 
paths. Similarly, the third summand refers to all paths along the root bo, whose 
length is determined by taking the distance from T\ to the root plus the distance 
from there to T3. 



As there are altogether nin 3 different paths from T\ to T3 in the tree, we have: 
(2) n 3 + ]T /',' '• • /'! ' n.n,. 

i:0<t i <T 1 



Then, by HI and (J2]), we get 



E 1<3 = — . — •( ni n 3 T 3 + {n x n 3 - 2n 3 )T x - 2 • ^ P^U ) , 

121 n3 \ i:0<t,<Ti / 



and thus 

(3) E 1 , 3 = T 3 + T ^T X - V Ql'% 

i:0<t i <T 1 

Analogously, 

(4) E ia = T 2 + 7 ^-^ Tl - Y, 

Til ' 

i:0<ti<Ti 

Thus, with ([3]) and ([!]), we can conclude: 



£1,3 < £1,2 & t 3 - E <3!' 3 *i < T 2- E Qi 1,2 ti. 

i:0<ti<Ti i:0<*i<Ti 

^ r 3 -T 2 < E (Q]' 3 - Q]' 2 )t, 

i:0<ti<Ti 



□ 

Corollary 2.3. For a given tree there exist times < T\ < T2 < T 3 such that 
Ei <3 < E 1<2 if and only if E (Qi ~ Qi )*» > °- 

i:0<*i<Ti 



Proof. If E (Qi ' 3 - Ql^Yi < 0. t hen by LemmaOwe need T 2 > T 3 in order 

i:0<ti<Ti 

to get 251,3 < 2?i, 2 . Hence, there are no values < T± < T 2 < T 3 such that T 3 - T 2 
fulfills the required condition, and so E X:3 > E\$ for all choices of T t . Conversely, 
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suppose E (Qi ,S ~ Ql^U > 0- Thcn , select T U T 2 with < T x < T 2 and set 

i:0<ti<Ti 

i:0<ti<Ti 

Then, T 3 > T 2 and 

n-T 2 = 1 -- Yl iQ\' a -Q\ a )ti< £ (Q-' 3 -q!'% 

i:0<ti<Ti i:0<ti<Ti 

By Lemma E21 this choice of < T x < T 2 < T 3 leads to Ei, a < E x , 2 . 

□ 

Corollary 2.4. // either (i) n\ = 2 or (ii) no extinction occurs in the tree and 
n 2 = n 3 , then £1,3 > £4,2- 

Proof. (i) Note that if n\ = 2, obviously only one bifurcation, say 6j (for some i 
such that < <j < Ti), contributes to the number n\ of lineages at time Ti, 
all the branches added by additional bifurcations become extinct before T\. 
Thus: P. 1 ' 3 , p}< 2 =i and P, 1 ' 3 , P, 1,2 = for all i / I 

Analogously to the proof of Lemma 12.21 we have for n\ — 2: n\n 3 — 2n 3 = 
n 3 + P.' and n\n 2 = 2n2 = n 2 + P, ' . Thus, n 2 = P, ' and n 3 — P. ' . 

Therefore, Q 1 ' 2 = Q 1 ' 3 = ±- and Q\ 2 = Q)' 3 = for all i ^ i. Thus, 

E (QY ~ Qi 2 )U = an d it follows with Corollary [U that E 1>3 > E^ 2 . 

i:0<ti<Ti 

(ii) In this case, obviously Q\ 2 = Q]' 3 for all i : < ti < T\ and therefore 
E (Q!' 3 ~ Qi' 2 )^ = °- Thus > b y CorollaryrOl #1,3 > E lt3 . 

i:0<*i<Ti 

□ 



Lemma 12.21 essentially states that the expected degree of relatedness from taxa 
of time T\ to taxa of time T3 can be larger than the one to taxa of time T 2 , but it 
requires the distance from T 2 to T3 to be "small enough" . Whether such a solution 
is feasible can be checked via Corollarv l2.3l Lemma |2~21 shows already how the role 
of intermediates depends on the times the fossils are taken from. Corollary 12. 4IT ) 
on the other hand shows how the tree itself has an impact on the expected values: 
if the tree shape (including branch lengths) is such that at time T\ only two taxa 
exist, then the just mentioned scenario cannot happen as the condition of Corollary 
El is not fulfilled. 

However, we can prove an even stronger result, namely that not only E\ t3 < E\^ 2 
is possible, but E 13 < mix\{Ei 2 , E 23 ] can be obtained for a suitable choice of times 
T\,T 2 ,T 3 . For this, we need the following lemma. 
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Lemma 2.5. Given a rooted binary phylogenetic tree with times < T\ < T 2 < T 3 
and the root at time to = 0. Then < Ei,z if and only if 

712 77-1 i:0<*i<Ti f.TxKt^T-z 



Proof. As in the proof of Lemma [2721 we have (c/. ([3])) 

(5) E 1 , 3 = T 3 + ^T 1 - £ Ql>% 

1 i:0<t;<Ti 



(6) Analogously, E 2<3 = T 3 + ^—^T 2 - V Ql% 

i:0<ti<T 2 



Thus, Ei j3 < i?2,3 if and only if 

E tf s * < E Qf 3 u, 

i:0<t;<Ti 2 i:0<t;<T 2 



which holds precisely if 



7^2 n-\ — ' L — ' 

i:0<ti<Ti i:Ti<ti<T 2 

□ 



With the help of the two lemmas we can now state the following theorem. 

Theorem 2.6. Given a rooted binary phylogenetic tree with times < T\ < T 2 < T 3 
and the root at time 0. Then, E\ j3 < min{i?i.2, £^2,3} */ and only if the following 
two conditions hold: 

(i)T 3 -T 2 < £ (Q\-3-Q^) U , 

i:0<ti<Ti 

(a) ^t 2 - sjj==T! > E (Q?' 3 - Q, 1 ' 3 )** + E Q 2 i' 3 u. 

i:0<ti<T! i:T!<U<T 2 



Proof. The Theorem follows directly from Lemmas 12.21 and 12.51 □ 



The following example demonstrates the influence of times < T\ < T 2 < T3 
according to the above theorem. 
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Example 2.7. Consider again Figured 

(1) Assume t x = 15, Ti = 100, t 2 = 107, t 3 = 109, T 2 = 110, T 3 = 130. Then, 
Ei t 2 = 137.33, E2.3 — 155.28 and E\^ = 155.33. Hence, for this choice of 
times, we have E±^ > max{i?i : 2, ^2,3}- 

(2) Consider the same times as in the previous case, but choose T% = 129 
instead of T2 = 110. This means to move T2 further away from T\ and 
closer to T3. This change is enough to give completely different expected 
values: £?i 2 = 156.33, £^,3 = 166.68 and £^3 = 155.33. Hence, for this 
choice of times, we have £1,3 < minj-Ei^, £^2,3}- 



3. Discussion 

The analysis of the fossil record provides an insight into the history of species 
and thus into evolutionary processes. Stochastic models can provide a useful way 
to infer patters of diversification, and they form a useful link between molecular 
phylogenetics and paleontology [5] . Such models would greatly benefit from incor- 
poration of potential fossil ancestors and other extinct data points to infer patterns 
of evolution. In this paper we have applied a simple model-based phylogenctic ap- 
proach to study the expected degree of similarity between fossil taxa sampled at 
intermediate times. 

'Gaps' in the fossil record are problematic [10] as they can be interpreted as 'miss- 
ing links'. Therefore, numerous studies concerning the adequacy of the fossil record 
have been conducted (see, for example, [3], [9], [13]), and it is frequently found that 
even the available fossil record is still incompletely understood. This is particularly 
true for ancestor-descendant relationships (see, for instance, [4], [5]). For exam- 
ple Foote [5] reported the probability that a preserved and recorded species has 
at least one descendant species that is also preserved and recorded is on the order 
of 1%-10%. This number is much higher than the number of identified ancestor- 
descendant pairs. Thus, it remains an important challenge to recognize such pairs 
PP. This is also essential with regard to ancestor- intermediate-descendant triplets, 
as it is possible that there are in fact fewer 'gaps' than currently assumed, i.e. 
that intermediates are present but not yet recognized. Such issues have an impor- 
tant bearing on any conclusions our results might imply concerning the testing of 
hypotheses of continuous morphological evolution, or concerning the shape of the 
underlying evolutionary tree based on the non-existence of certain intermediates. 

Another challenge is to investigate different phylogenetic models for describ- 
ing the expected degree of morphological separation between different fossil taxa 
sampled at different times. Our findings strongly depend on the assumption that 
morphological diversification is proportional to the distance in the underlying phy- 
logenetic tree. This is justified if morphological difference is proportional to the 
number of differing discrete characters, that each of these characters changes at a 
constant rate over the time period of sampling, and that homoplasy is rare. This 
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last assumption requires the rate of character change to be sufficiently small in re- 
lation to the time period of the sampling - the appearance of reverse or convergent 
character states will lead to a more concave (rather than linear) relationship be- 
tween morphological divergence and path distance. A similar concave relationship 
might be expected for continuous morphological evolution as described by neutral 
Brownian- motion. 

Thus, the impact of different assumptions on the role of intermediates could be 
further investigated. But even if we assume that diversification is proportional to 
time, there may be other ways to measure 'distance' that could be usefully explored 
- for instance, one could define the distance between two taxa to be the maximum 
(rather than the sum) of the two divergences times of the taxa back to their most 
recent common ancestor. This definition of distance allows the degree of relatedness 
to be higher for taxa on the same clade than for other taxa. In this case, there exist 
analogous results to Lemmas 12.21 and 12.51 (results not shown) , but the formulae are 
somewhat different, particularly for Lemma 12.51 
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